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TITLE OF THE INVENTION 

DELTA 6 FATTY ACID DES ATURASE 

CROSS-REFERENCE TO RELATED APPLICATIONS 
5 Not applicable. 

STATEMENT REGARDING FEDERALLY-SPONSORED R&D- ~- 
Not applicable. 

10 REFERENCE TO MICROFICHE APPENDIX 
Not applicable. 

FIELD OF THE INVENTION 

The present invention is directed to novel human DNA sequences 
1 5 encoding a delta 6 fatty acid desaturase, an enzyme involved in the synthesis of 
essential fatty acids. 

BACKGROUND OF THE INVENTION 

Essential fatty acids (EFAs) are polyunsaturated fatty acids that cannot 

20 be manufactured by mammals, yet are required for a number of important biochemical 
processes, and thus must be supplied in the diet. The most important dietary EFAs 
are linoleic acid and alpha-linolenic acid (ALA). These two EFAs undergo a number 
of biosynthetic reactions that convert them into various other EFAs. Figure 1 depicts 
the biosynthetic reactions involving the two groups of EFAs, the n-6 EFAs (linoleic 

25 acid derivatives) and the n-3 EFAs (ALA derivatives). EFAs are formed from linoleic 
acid and ALA by a series of alternating reactions involving the removal of two 
hydrogens coupled with the insertion of an additional double bond (desaturation) and 
the lengthening of the fatty acid chain by the addition of two carbons (chain 
elongation). The enzymes catalyzing the desaturations and elongations are thought to 

30 be the same for both groups of EFAs. 

Among the more important unsaturated fatty acids are the delta 6 
unsaturated fatty acids, which are involved in the maintenance of membrane structure 
and function, the regulation of cholesterol synthesis and transport, and the prevention 
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of water loss from the skin. Delta 6 unsaturated fatty acids also serve as precursors of 
the eicosanoids, including the prostaglandins and leukotrienes (Horrobin, 1992, Prog. 
Lipid Res. 31:163-194). The double bond at the 6 position of delta 6 unsaturated fatty 
acids is introduced by a class of enzymes known as delta 6 desaturases. 
5 Deficiencies in linoleic acid and ALA derivatives have been associated 

with skin diseases, diabetic complications, inflammatory and autoimmune disorders, 
cardiovascular disorders, complications of viral infection, and retinal dysfunction. 
For example, a deficiency in gamma-linolenic acid (GLA), which is produced from 
linoleic acid by the action of the enzyme delta 6 desaturase, can arise from the 

10 decreased activity of this enzyme that occurs in aging, stress, diabetes, eczema, and 
some infections, or from increased catabolism of GLA due to oxidation or rapid cell 
division, as occurs in inflammation or cancer. Clinical trials have demonstrated that 
dietary GLA supplementation can be effective in treating a number of conditions that 
are associated with GLA deficiency, e.g., atopic eczema, mastalgia, diabetic 

15 neuropathy, viral infections, and some forms of cancer (Horrobin, 1990, Rev. 
Contemp. Pharmacother. 1:1-45). 

Delta 6 desaturase is an example of a fatty acid desaturase. Fatty acid 
desaturases are enzymes that introduce a double bond into the carbon chain of fatty 
acids. They play vital roles in the biosynthesis of polyunsaturated fatty acids, 

20 including the essential fatty acids. Fatty acid desaturases are present in soluble and 
membrane-associated forms and require electron donors (for example, cytochrome 
b5) for their functioning. 

Delta 6 desaturases catalyze the rate-limiting steps in the biosyntheses 
of the linoleic and ALA group EFAs shown in Figure 1. End products of the linoleic 

25 acid pathway include the eicosanoids (prostaglandins and leukotrienes). The end 
product of the ALA pathway is docosahexaenoic acid (DHA), an important 
component of membranes in the vertebrate retina. DHA is highly specific for retina 
and represents more than 50% of the fatty acids in the rod outer segment (ROS). It 
appears that DHA is important in maintaining the normal structure and function of the 

30 retina (Anderson et al., 1992, Neurobiology of Essential Fatty Acids, Bazan et al, 
eds., Plenum Press, New York, pages 285-294). Increased dietary consumption of 
DHA and its precursor, eicosapentaenoic acid, from seal meat and fish has been 
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linked to an increased incidence of macular degeneration in Greenland Eskimos 
(Rosenberg, 1987, Arct. Med. Res. 46:64-70). 

Certain delta 6 desaturases have been cloned from plants. For 
example, a delta 6 desaturase has been cloned from borage (Sayanova et al., 1997, 

5 Proc. Natl. Acad. Sci. USA 94:421 1-4216). This delta 6 desaturase is unusual in that 
its cytochrome b5 electron donor is present as an N-terminal extension of the enzyme 
rather than being synthesized-as a separate protein. The borage delta 6 desaturase has 
been shown to be functional, in that transfer of the cloned gene encoding it to tobacco 
results in the synthesis of high levels of GLA and octadecatetraenoic acid (OTA) in 

10 the transgenic tobacco leaves. GLA and OTA are the products of delta 6 desaturase 
activity on linoleic acid and ALA, respectively. 

Based on its hydropathy profile, the borage delta 6 desaturase appears 
to be a membrane-bound protein. Examination of the amino acid sequence of the 
borage enzyme, as well as the amino acid sequences of membrane-bound desaturases 

1 5 from a wide variety of organisms, has revealed three regions of conserved short 
motifs containing histidine residues (HX(3 0 r 4)H, HX(2 or 3)HH, and HX(2 or 
3)HH) having a conserved spacing from each other (Shanklin et al., Biochemistry, 

1994,33:12787-12794). 

A DNA sequence has been isolated from sunflower embryos that, 
20 judging from its sequence, appears to encode a delta 6 desaturase having a 

cytochrome b5-like moiety fused to its N-terminus (Sperling et al., 1995, Eur. J. 
Biochem. 232:798-805). 

SUMMARY OF THE INVENTION 

25 The present invention is directed to novel human DNA sequences that 

encode a delta 6 fatty acid desaturase, cytochrome b5-related protein (CYB5RP). The 
present invention includes genomic CYB5RP DNA as well as cDNA that encodes the 
CYB5RP protein. The genomic CYB5RP DNA is substantially free from other 
nucleic acids and has the nucleotide sequence shown in SEQ.ID.NO.:l. The cDNA 

30 encoding CYB5RP protein is substantially free from other nucleic acids and has the 
nucleotide sequence shown in SEQ.ID.NO.:2. Also provided is CYB5RP protein 
encoded by the novel DNA sequences. The CYB5RP protein is substantially free 
from other proteins and has the amino acid sequence shown in SEQ.ID.NO.uJ. 
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Methods of expressing CYB5RP protein in recombinant systems are provided. Also 
provided are methods of producing delta 6 unsaturated fatty acids using DNA 
encoding CYB5RP or using CYB5RP protein. 

5 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 depicts the enzymatic conversions involved in the linoleic 
acid (n-3) and alpha-linolenic acid (n-6) pathways of essential fatty acid synthesis. — 

Figure 2A-G shows the genomic DNA sequence of the CYB5RP gene 
(SEQ.ID.NO.: 1). Underlined nucleotides in capitals represent exons. The start ATG 
10 codon at position 544 in exon 1 and the stop TGA codon at position 18,103 in exon 
12 are shown in bold. The putative polyadenylation signal ATT AAA located 
approximately 20 base pairs upstream of the polyA tail is shown in bold italics 
(position 18,373 in exon 12). DNA sequence upstream of exon 1 represents a 
putative promoter region of the CYB5RP gene., as indicated by the presence of the 
1 5 TATA box at position 353 (underlined bold).. 

Figure 3A-C shows the cDNA sequence (SEQ.ID.NO.:2) and the 
amino acid sequence (SEQ.ID.NO. :3) of CYB5RP. The region encompassing amino 
acids 1-102 represents the cytochrome b5 domain. The region encompassing amino 
acids 182-186 represents HIS BOX 1. The region encompassing amino acids 219-223 
20 represents HIS BOX 2. The region encompassing amino acids 383-387 represents 
HIS BOX 3. 

Figure 4 shows a portion of the cDNA sequence (SEQ.ID.NO. :4) and a 
portion of the amino acid sequence (SEQ.ID.NO. :5) of mouse CYB5RP. 

Figure 5 A shows a Kyte-Doolittle hydropathy plot of CYB5RP. 
25 Figure 5B shows the proposed membrane topology of CYB5RP based on its 
hydropathy plot. This membrane topology is similar to that proposed for other 
membrane-bound fatty acid desaturases (Shanklin et al., Biochemistry, 1994, 
33:12787-12794). The amino acids shown in Figure 5B are portions of 
(SEQ.ID.NO.:3). 

30 Figure 6 shows the output of the Profilescan program from the 

Wisconsin GCG package. The upper amino acid sequence is from CYB5RP 
(positions 31-78 of SEQ. ID. NO.3). The lower amino acid sequence is positions 1- 
48 of the cytochrome b5 profile (SEQ. ID. NO.:6.). The output shows that CYB5RP 
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contains a profile typical for the heme-binding domain of the cytochrome b5 protein 
family. Importantly, the region of identity includes the invariant HPGG motif, where 
histidine represents a heme axial ligand for iron. 

Figure 7 A and B show the results of BlastP searches of the GenBank 

5 database using the full-length CYB5RP amino acid sequence as the query. Figure 7A 
shows the hit with highest homology, a hypothetical protein from sunflower. The 
- sunflower protein and CYB5RP share three His boxes (boxed) in which the spacing 
between the His boxes is conserved. Also boxed is the HPGG motif typical for the 
heme-binding domain of the cytochrome b5 protein family.- In both proteins the first ■■: 

1 0 histidine of the third His box is replaced by glutamine (a typical feature of desaturases 
with delta 6 specificity). The upper amino acid sequences shown are from CYB5RP 
and are portions of SEQ. ID. NO.3. The. lower amino acid sequences shown are r - 
portions of the amino acid sequence of the hypothetical protein from sunflower 
(Sperling et aL, 1995, Eur. J. Biochem. 232:798-805). The sequence shown as 

15 positions 348-432 is SEQ. ID. NO.:7. The sequence shown as positions 22-74 is 
SEQ. ID. NO.:8. The sequence shown as positions 152-227 is SEQ. ID. NO.:9. 
Figure 7B shows the hit with the second highest homology, a delta 6 desaturase from 
Borago oficinalis (Sayanova et aL, 1997, Proc. Natl. Acad. Sci. USA 94:4211-4216). 
The Borago protein and CYB5RP also share three His boxes with conserved spacing, 

20 as well as the HPGG motif. In both proteins the first histidine of the third His box is 
replaced by glutamine (a typical feature of desaturases with delta 6 specificity). The 
upper amino acid sequences shown are from CYB5RP and are portions of SEQ. ID. 
NO.3. The lower amino acid sequences shown are portions of the amino acid 
sequence of the Borago delta 6 desaturase. The sequence shown as positions 338-424 

25 is SEQ. ID. NO.:10. The sequence shown as positions 12-64 is SEQ. ID. NO.:l 1. 
The sequence shown as positions 1 53-220 is SEQ. ID. NO.: 12. 

Figure 8 shows additional results of BlastP searches of the GenBank 
database using the CYB5RP protein as the query. Figure 8 shows the amino acid 
alignment between the CYB5RP protein and a delta 6 desaturase from Synechocystis 

30 sp. (strain pec 6803) performed by the BlastP program. The Synechocystis delta 6 
desaturase and CYB5RP share three His boxes, two of which are shown in Figure 8 
(boxed). In both proteins the first histidine of the third His box is replaced by 
.. - glutamine (a typical feature .of desaturases with delta 6. specificity).. The CYB5RP 
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sequence shown is a portion of SEQ. ID. NO.3. The Synechocystis sequence shown is 
SEQ. ID.NO:13. 

Figure 9A shows the expression pattern of the CYB5RP gene in 9 
human tissues, as determined by RT-PCR amplification with 21 cycles. Expression is 
5 detected in human retina, kidney, pancreas, placenta, and brain. Figure 9B shows the 
results of the analogous experiments performed with 25 cycles of amplification. 
Expression of the CYB5RP gene is seen in all the human tissues studied. 

DETAILED DESCRIPTION OF THE INVENTION 
1 0 For the purposes of this invention: 

"Substantially free from other proteins" means at least 90%, preferably 
95%, more preferably 99%, and even more preferably 99.9%, free of other proteins. 
Thus, a CYB5RP protein preparation that is substantially free from other proteins will 
contain, as a percent of its total protein, no more than 10%, preferably no more than 
1 5 5%, more preferably no more than 1 %, and even more preferably no more than 0. 1 %, 
of non-CYB5RP proteins. Whether a given CYB5RP protein preparation is 
substantially free from other proteins can be determined by such conventional 
techniques of assessing protein purity as, e.g., sodium dodecyl sulfate polyacrylamide 
gel electrophoresis (SDS-PAGE) combined with appropriate detection methods, e.g., 
20 silver staining or immunoblotting. 

"Substantially free from other nucleic acids" means at least 90%, 
preferably 95%, more preferably 99%, and even more preferably 99.9%, free of other 
nucleic acids. Thus, a CYB5RP DNA preparation that is substantially free from other 
nucleic acids will contain, as a percent of its total nucleic acid, no more than 10%, 
25 preferably no more than 5%, more preferably no more than 1 %, and even more 
preferably no more than 0.1%, of non-CYB5RP nucleic acids. Whether a given 
CYB5RP DNA preparation is substantially free from other nucleic acids can be 
determined by such conventional techniques of assessing nucleic acid purity as, e.g., 
agarose gel electrophoresis combined with appropriate staining methods, e.g., 
30 ethidium bromide staining, or by sequencing. 

"Substantially the same biological activity as CYB5RP" means being 
able to introduce a double bond into the 6 position of linoleic acid under conditions in 
which CYB5RP isjble to introduce a double bond into the 6 position of linoleic acid. 
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A "conservative amino acid substitution" refers to the replacement of 
one amino acid residue by another, chemically similar, amino acid residue. Examples 
of such conservative substitutions are: substitution of one hydrophobic residue 
(isoleucine, leucine, valine, or methionine) for another, substitution of one polar 

5 residue for another polar residue of the same charge (e.g., arginine for lysine; 

glutamic acid for aspartic acid); substitution of one aromatic amino acid (tryptophan,- 
tyrosine, or phenylalanine) for another. -~ Ti-i-f-^.r- 
The present invention relates to the identification and cloning of- 
r cytochrome b5-related protein (CYB5RP), a gene which encodes a human delta 6. J.. 

10 fatty acid desaturase. The geneis present on PAC clones 759J12, 756B3, 519013,, - 
and 466A1 1-from an-area : of human chromosome llql2 that has been shown to 
■ contain a gene related to Best's macular dystrophy (Cooper et al, 1997, Genomics 
41:185-192; Stohr et al. 9 1997, Genome Res. 8:48-56; Graff et al. 9 1997 Hum. Genet. 
101 : 263-279). This linkage between the chromosomal location of the CYB5RP gene 

15 and the location of the gene related to Best's macular dystrophy can be used 

diagnostically by identifying restriction fragment length polymorphisms (RFLPs) in 
the vicinity of the CYB5RP gene, e.g., in SEQ.DD.NO.:l. Such RFLPs will be 
associated with the Best's macular dystrophy gene and thus can be used to identify 
individuals carrying disease-causing forms of the Best's macular dystrophy gene. 

20 CYB5RP was identified as an EST hit in sequence scanning data from 

PAC clones from human chromosome 1 lql2. In addition, a full length cDNA of 
CYB5RP was recovered from a human retina cDNA library. The genomic region of 
CYB5RP has been sequenced and the exon/intron organization of CYB5RP has been 
determined. The CYB5RP gene has 12 exons. The promoter region of CYB5RP was 

25 identified upstream of the 5' UTR by detecting consensus elements required for 

eukaryotic transcription. The expression pattern of CYB5RP was determined by RT- 
PCR analysis in 9 human tissues. The CYB5RP gene is expressed predominantly in 
human retina, kidney, pancreas, and placenta; lower levels of expression are also 
detected in brain,, heart, lung, liver, and skeletal muscle. Bioinformatic analysis 

30 revealed significant homology to a group of plant and bacterial fatty acid desaturases. 
All of the typical amino acid motifs present in these fatty acid desaturases are also 
present in CYB5RP. Kyte-Doolittle algorithm analysis predicts a transmembrane 
organization typical ofiatty acid desaturases for CYB5RP (see Figure 5). CYB5RP.i_s, 
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unusual in that it contains a cytochrome b5 region in its N terminus. While many 
fatty acid desaturases utilize cytochrome b5 as an electron donor, most have not 
incorporated this cytochrome as part of their polypeptide chain. 

That CYB5RP is a fatty acid desaturase is shown by the following 

5 evidence: 

( 1 ) C YB5RP possesses significant homology to a group of plant 
and microbial fatty acid desaturases; 

(2) Like other fatty acid desaturases, CYB5RP has three conserved 
histidine boxes, with correct spacing between the boxes; and 

10 (3) The predicted membrane topology of CYB5RP is similar to 

that of known fatty acid desaturases. 

That CYB5RP is a delta 6 fatty acid desaturase is shown by the 
following evidence: 

(1 ) CYB5RP contains a cytochrome b5-like moiety fused to its N- 
1 5 terminus. The only two fatty acid desaturases that contain cytochrome b5-like moiety 

fused to their N-termini are known or suspected to be delta 6 desaturases. 

(2) The only two plant desaturases that are known or suspected to 
introduce a double bond in the 6 position have an atypical His box 3 (QI/LEHH), with 
a Q in the first position rather than an H. CYB5RP has the same atypical His Box 3. 

20 (3) The only bacterial desaturase that is known to introduce a 

double bond in the 6 position has an atypical His box 3 (QVTHH), with a Q in the 
first position rather than an H. CYB5RP has the same atypical His Box 3. 

C YB5RP is a target for the development of drugs for the treatment of 
disorders of lipid metabolism and for the treatment of conditions that require the 

25 modulation of the biosynthesis of prostaglandins and leukotrienes (asthma, pain, etc.). 
CYB5RP is also a target for the development of drugs for use in treating skin 
diseases, diabetic complications, reproductive disorders, including breast pain and 
premenstrual syndrome, inflammatory and autoimmune disorders, cardiovascular 
disorders, complications of viral infections, and various forms of retinal degeneration, 

30 including age-related macular degeneration. 

CYB5RP is homologous to a delta 6 desaturase from Borago oficinalis 
(see Figure 7B). Both CYB5RP and this Borago delta 6 desaturase, unlike 
desaturases from higher plants, are unusual in containing a cytochrome b5-like 
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domain fused to their N-termini (Sayanova et al., 1997, Proc. Natl. Acad. Sci. USA 
94:421 1-4216; hereinafter "Sayanova"). The Borago desaturase has been expressed 
in transgenic tobacco, resulting in high levels of delta 6 desaturated fatty acids in the 
transgenic tobacco leaves, including high levels of y-linolenic acid (GLA) (Sayanova). 

5 Given the medical importance of GLA, Sayanova proposed that transgenic plants, 
expressing the Borago delta 6 desaturase, would be valuable as sources of GLA. 
Similarly, CYB5RP, expressed in transgenic plants, is expected to provide arvaluable- 
source of GLA; " - 

- - : -The present invention provides DNA encoding CYB5RP that is—. - 

10 substantially free from other nucleic acids. The present invention also provides 
-recombinant DNA molecules encoding CYB5RP. The present invention provides 
DNA molecules substantially free from other nucleic acids comprising the nucleotide 
sequence shown in Figure 2 as SEQ.ID.NO.:l . Analysis of SEQ.ID.NO.rl revealed 
that this genomic sequence defines a gene having 12 exons. These exons collectively 

15 have an open reading frame that encodes a protein of 445 amino acids. When an 
alternatively spliced exon 8 is used, a CYB5RP protein of 433 amino acids, lacking 
amino acids 317-328, is produced. Thus, the present invention includes two cDNA 
molecules, encoding two forms of CYB5RP protein, that are substantially free from 
other nucleic acids. The first cDNA is shown in Figure 3 and has the nucleotide 

20 sequence SEQ.ID.NO.:2. The second cDNA is identical to the first, except that it 
does not contain the nucleotides at positions 1,019-1,054. 

The present invention includes DNA molecules substantially free from 
other nucleic acids comprising the coding region of SEQ.ID.NO.:2. Accordingly, the 
present invention includes DNA molecules substantially free from other nucleic acids 

25 having a sequence comprising positions 71-1,405 of SEQ.ID.NO.:2. The present 
invention also includes DNA molecules substantially free from other nucleic acids 
having a sequence comprising positions 71-1,405 of SEQ.ID.NO/.2, except that the 
nucleotides at positions 1,019-1,054 are missing. Also included in the present 
invention are recombinant DNA molecules having a nucleotide sequence comprising 

30 positions 71-1,405 of SEQ.ID.NO.:2 and recombinant DNA molecules having a 
nucleotide sequence comprising positions 71-1,405 of SEQ.ID.NO.:2 with the 
exception that positions 1,019-1,054 are missing. 
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The novel DNA sequences of the present invention encoding 
CYB5RP, in whole or in part, can be linked with other DNA sequences, i.e., DNA 
sequences to which CYB5RP is not naturally linked, to form "recombinant DNA 
molecules" encoding CYB5RP. Such other sequences can include DNA sequences 

5 that control transcription or translation such as, e.g., translation initiation sequences, 
promoters for RNA polymerase n, transcription or translation termination sequences, 
enhancer sequences, sequences that control replication in microorganisms, sequences 
that confer antibiotic resistance, or sequences that encode a polypeptide "tag" such as, 
e.g., a polyhistidine tract or the myc epitope. The novel DNA sequences of the 

10 present invention can be inserted into vectors such as plasmids, cosmids, viral 
vectors, PI artificial chromosomes, or yeast artificial chromosomes. 

Included in the present invention are DNA sequences that hybridize to 
at least one of SEQ.ID.NOs.:l or 2 under stringent conditions. By way of example, 
and not limitation, a procedure using conditions of high stringency is as follows: 

1 5 Prehybridization of filters containing DNA is carried out for 2 hr. to overnight at 65°C 
in buffer composed of 6X SSC, 5X Denhardt's solution, and 100 ng/ml denatured 
salmon sperm DNA. Filters are hybridized for 12 to 48 hrs at 65°C in 
prehybridization mixture containing 100 jig/ml denatured salmon sperm DNA and 5- 
20 X 10 6 cpm of 32 P-labeled probe. Washing of filters is done at 37°C for 1 hr in a 

20 solution containing 2X SSC, 0.1% SDS. This is followed by a wash in 0.1X SSC, 
0.1% SDS at 50°C for 45 min. before autoradiography. 

Other procedures using conditions of high stringency would include 
either a hybridization carried out in 5XSSC, 5X Denhardt's solution, 50% formamide 
at 42°C for 12 to 48 hours or a washing step carried out in 0.2X SSPE, 0.2% SDS at 

25 65°C for 30 to 60 minutes. 

Reagents mentioned in the foregoing procedures for carrying out high 
stringency hybridization are well known in the art. Details of the composition of 
these reagents can be found in, e.g., Sambrook, Fritsch, and Maniatis, 1989, 
Molecular Cloning: A Laboratory Manual second edition, Cold Spring Harbor 

30 Laboratory Press. In addition to the foregoing, other conditions of high stringency 
which may be used are well known in the art. 

The degeneracy of the genetic code is such that, for all but two amino 
acids, more than a single codon encodes a particular amino acid. This allows for the 
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construction of synthetic DNA that encodes the CYB5RP protein where the 
nucleotide sequence of the synthetic DNA differs significantly from the nucleotide 
sequence of SEQ.ID.NO.:2, but still encodes the same CYB5RP protein shown as 
SEQ.ID.NO.:3. Such synthetic DNAs are intended to be within the scope of the 
5 present invention. Also with the scope of the present invention are synthetic DNAs 
that encode a CYB5RP protein lacking amino acids 317-328 of SEQ.ID.NO.:3. 

„, Another aspect of the present invention includes host cells that have- - : 

been engineered to contain and/or express DNA sequences encoding CYB5RP 
- protein. Such recombinant hostcellsxan be cultured under suitable conditions to.. - 

-1 0 produce CYB5RP- protein. An expression vector containing DNA encoding GYB5RPr 
protein can be'used for expression of CYB5RP protein in a recombinant hostxell.^^ 
Recombinant host cells may be prdkaryotic or eukaryotic, including but not limited to; 
bacteria such as E. coli, fungal cells such as yeast, mammalian cells including, but not 
limited to, cell lines of human, bovine, porcine, monkey and rodent origin, plant cells 

15 such as tobacco, and insect cells including but not limited to Drosophila and 

silkworm derived cell lines. Cell lines derived from mammalian species which are 
suitable for recombinant expression of CYB5RP protein and which are commercially 
available, include but are not limited to, L cells L-M(TK") (ATCC CCL 1.3), L cells 
L-M (ATCC CCL 1.2), 293 (ATCC CRL 1573), Raji (ATCC CCL 86), CV-1 (ATCC 

20 CCL 70), COS-1 (ATCC CRL 1650), COS-7 (ATCC CRL 1651), CHO-K1 (ATCC 
CCL 61), 3T3 (ATCC CCL 92), NIH/3T3 (ATCC CRL 1658), HeLa (ATCC CCL 2), 
C127I (ATCC CRL 1616), BS-C-1 (ATCC CCL 26) and MRC-5 (ATCC CCL 171). 

A variety of mammalian expression vectors can be used to express 
recombinant CYB5RP in mammalian cells. Commercially available mammalian 

25 expression vectors which are suitable include, but are not limited to, pMClneo 

(Stratagene), pSG5 (Stratagene), pcDNAI and pcDNAIamp, pcDNA3, pcDNA3.1, 
pCR3.1 (Invitrogen), EBO-pSV2-neo (ATCC 37593), pBPV-l(8-2) (ATCC 371 10), 
pdBPV-MMTneo(342-12) (ATCC 37224), pRSVgpt (ATCC 37199), pRSVneo 
(ATCC 37198), and pSV2-dhfr (ATCC 37146). Following expression in 

30 recombinant cells, CYB5RP can be purified by conventional techniques to a level that 
is substantially free from other proteins. A description of vectors that can be used to 
express CYB5RP can be found in, e.g., Goeddel, ed., 1990, Meth. Enzymol vol. 185 

. _ or PerbaL- 1988. A Practical Guide to Molecular Cloning , John Wiley and Sons,,Inc. 
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The present invention includes CYB5RP protein substantially free 
from other proteins. The amino acid sequence of the full-length CYB5RP protein is 
shown in Figure 3 as SEQ.EXNO.:3. Thus, the present invention includes CYB5RP 
protein substantially free from other proteins having the amino acid sequence 
5 SEQ.ID.NO.:3. Also included in the present invention is a CYB5RP protein that is 
produced from an alternatively spliced CYB5RP mRNA where the protein has the 
amino acid sequence of SEQ.DD.NO.:3 with the exception that amino acids 317-328 
are missing. 

As with many proteins, it is possible to modify many of the amino 

1 0 acids of CYB5RP and still retain substantially the same biological activity as the 
original protein. Thus, the present invention includes modified CYB5RP proteins^ 
which have amino acid deletions, additions, or substitutions but that still retain 
substantially the same biological activity as CYB5RP. It is generally accepted that 
single amino acid substitutions do not usually alter the biological activity of a protein 

15 (see, e.g., Molecular Biology of the Gene , Watson et al. y 1987, Fourth Ed., The 
Benjamin/Cummings Publishing Co., Inc., page 226; and Cunningham & Wells, 
1989, Science 244:1081-1085). Accordingly, the present invention includes 
polypeptides where one amino acid substitution has been made in SEQ.ID.NO.:3 
wherein the polypeptides still retain substantially the same biological activity as 

20 CYB5RP. The present invention also includes polypeptides where two or more 

amino acid substitutions have been made in SEQ.K).NO.:3 wherein the polypeptides 
still retain substantially the same biological activity as CYB5RP. In particular, the 
present invention includes embodiments where the above-described substitutions are 
conservative substitutions. In particular, the present invention includes embodiments 

25 where the above-described substitutions do not occur in the His boxes of CYB5RP. 
In particular, the present invention includes embodiments where the above-described 
substitutions do not occur in positions where the amino acid present in those positions 
in CYB5RP is the same as the amino acid present in the corresponding position of the 
sunflower protein depicted in Figure 1 of Sperling et al., 1995, Eur. J. Biochem. 

30 232:798-805 when these two proteins are aligned by BLASTP analysis. In particular, 
the present invention includes embodiments where the above-described substitutions 
do not occur in positions where the amino acid present in those positions in CYB5RP 
is the same as the amino acid present in the corresponding position of the 
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CCCTCTACCCCTGTCCCATCAGGC (SEQ.ID.NO.:15) 
One of skill in the art would recognize that many other primer pairs 
based upon SEQ.ID.NO.:2 would also be suitable. 

PCR reactions can be carried out with a variety of thermostable 
5 enzymes including but not limited to AmpliTaq, AmpliTaq Gold, or Vent polymerase. 
For AmpliTaq, reactions can be carried out in 10 mM Tris-Cl, pH 8.3, 2.0 mM 
MgCl2, -200 "for each dNTP,-50 mM KC1, 0.2 nM for each primer, 10 ng of DNA 
template, 0.05 units/jil of AmpliTaq. The reactions are heated at 95°C for 3 minutes 
and "then cycled 35 times using the cycling parameters of 95°C, 20 seconds, 62°C, 20 
10 seconds, 72°C, 3 minutes. In addition to these conditions, a variety of suitable PCR 
protocols can be found in PCR Primer. A Laboratory Manual edited by C:W: " - 
DiefFenbach and G.S. Dveksler, 1995, Cold Spring Harbor Laboratory Press; or PCR 
Protocols: A Guide to Methods and Applications , Michael et al. 9 eds., 1990, 
Academic Press . 

1 5 A suitable cDNA library from which a clone encoding CYB5RP can 

be isolated would be Human Retina 5'-stretch cDNA library in lambda gtlO or 
lambda gtl 1 vectors (catalog numbers HL1 143a and HL1 132b, Clontech, Palo Alto, 
CA). The primary clones of such a library can be subdivided into pools with each 
pool containing approximately 20,000 clones and each pool can be amplified 

20 separately. 

By this method, a cDNA fragment encoding an open reading frame of 
either 445 amino acids (SEQ.ID.NO.:3) or an open reading frame of 433 amino acids 
(SEQ.ID.NO.:3 lacking the amino acids at positions 3 17-328) can be obtained. This 
cDNA fragment can be cloned into a suitable cloning vector or expression vector. For 

25 example, the fragment can be cloned into the mammalian expression vector 

pcDNA3.1 (Invitrogen, San Diego, CA). CYB5RP protein can then be produced by 
transferring an expression vector encoding CYB5RP or portions thereof into a 
suitable host cell and growing the host cell under appropriate conditions. CYB5RP 
protein can then be isolated by methods well known in the art. 

30 As an alternative to the above-described PCR method, a cDNA clone 

encoding CYB5RP can be isolated from a cDNA library using as a probe 
oligonucleotides specific for CYB5RP and methods well known in the art for 
screening cDNA libraries with oligonucleotide probes. Such methods are described 
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in, e.g., Sambrook et al. y 1989, Molecular Clonine: A La boratory Manual: Cold 
Spring Harbor Laboratory, Cold Spring Harbor, New York; Glover, D.M. (ed.), 1985, 
DNA Cloning: A Practical Approach , MRL Press, Ltd., Oxford, U.K., Vol. I, H 
Oligonucleotides that are specific for CYB5RP and that can be used to screen cDNA 

5 libraries can be readily designed based upon the cDNA sequence of CYB5RP shown 
in SEQ.E).NO.:2 and can be synthesized by methods well-known in the art. 

Genomic clones containing the CYB5RP gene can be obtained from 
commercially available human PAC or BAC libraries available from Research 
Genetics, Huntsville, AL. PAC clones containing the CYB5RP gene {e.g., PAC 

1Q clones 759J12, 756B3, 519013, and 466A1 1) are commercially available from 
Research Genetics, Huntsville, AL (Catalog number for individual PAC clones is 
RPCI.C). Alternatively, one may prepare genomic libraries, especially in PI artificial 
chromosome vectors, from which genomic clones containing the CYB5RP can be 
isolated, using probes based upon the CYB5RP sequences disclosed herein. Methods 

15 of preparing such libraries are known in the art (Ioannou et a/., 1994, Nature Genet. 
6:84-89). 

The present invention also provides oligonucleotide probes, based 
upon SEQ.ID.NO.:2 that can be used to determine the level of CYB5RP RNA in a 
sample. In particular, the present invention includes DNA oligonucleotides 

20 comprising at least 1 8 contiguous nucleotides of SEQ.ID.NO. :2. Also provided by 
the present invention are corresponding RNA oligonucleotides. The DNA or RNA 
oligonucleotide probes can be packaged in kits. 

In addition to the utilities described above, the present invention makes 
possible the recombinant expression of the CYB5RP protein in various cell types. In 

25 particular, it is advantageous to recombinantly express CYB5RP in plant cells. Such 
expression in plant cells provides a method for the production of high levels of 
valuable EFAs such as GLA and OTA in the recombinant plant cells. An example of 
such recombinant expression of a delta 6 fatty acid desaturase, in that case from 
borage, is described in Sayanova et al„ 1997, Proc. Natl. Acad. Sci. USA 94:421 1- 

30 42 1 6 (Sayanova). The recombinant expression of the borage delta 6 desaturase led to 
the production of high levels of GLA and OTA in the leaves of the tobacco plants in 
which it was expressed. The procedures described in Sayanova can be easily adapted 
to express CYB5RP in tobacco, thus providing an additional, useful way to produce 
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large amounts of valuable EFAs. Known methods of recombinantly expressing genes 
in other plant species beside tobacco can be used to express CYB5RP in those other 
species. 

The present invention also makes possible the development of assays 
5 which measure the biological activity oftheCYB5RP protein. Such assays using 

recombinantly expressed CYB5RP protein are especially of interest. 

Assays for CYB5RP protein activity can be used to screen libraries 6f-=\- 

compounds or other sources of compounds to identify compounds that are activators. 

or-inhibitorsofthe activity of CYB5RP protein. Such identified compounds can.. „ 
10 serve as "leads" for the development of pharmaceuticals that can be used to modulate 

the activityjrf _eYB5RP ihrpatients suffering from conditions where that activity is 

abnormal, e.g. , skin diseases, diabetic complications, inflammatory and autoimmune 

disorders, cardiovascular disorders, complications of viral infection, and retinal 

dysfunction such as macular degeneration. 
1 5 Such assays may comprise: 

(a) recombinantly expressing CYB5RP protein in a host cell; 

(b) measuring the biological activity of the recombinantly 
expressed CYB5RP protein in the presence and in the absence of a substance 
suspected of being an activator or an inhibitor of CYB5RP protein; 

20 where a change in the biological activity of the recombinantly 

expressed CYB5RP protein in the presence as compared to the absence of the 
substance indicates that the substance is an activator or an inhibitor of CYB5RP 
protein. 

In particular embodiments, the biological activity of the recombinantly 
25 expressed CYB5RP protein is the ability to introduce a double bond into the 6 
position of linoleic acid or alpha-linoleic acid. 

In some embodiments, it may be advantageous to insert additional 
steps between steps (a) and (b). Such additional steps might include lysing the host 
cell and fractionating its contents in order to partially purify the recombinantly 
30 expressed CYB5RP, thus facilitating exposure of the recombinantly expressed 
CYB5RP to the substance as well as to any substrate used in the assay. 

The present invention includes activators and inhibitors identified by 
. r --,the-methods^described-herein as well as. pharmaceuticaUompositions comprising 
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such activators and inhibitors. The activators and inhibitors are generally combined 
with pharmaceutically acceptable carriers before use to form pharmaceutical 
compositions. Examples of such carriers and methods of formulation of 
pharmaceutical compositions containing activators or inhibitors and carriers can be 

5 found in Remington's Pharmaceutical Sciences. To form a pharmaceutically 

acceptable composition suitable for effective administration, such compositions will 
contain an effective amount of the activator or inhibitor. 

Therapeutic or prophylactic compositions are administered to an 
individual in amounts sufficient to treat or prevent conditions where CYB5RP activity 

10 is abnormal. The effective amount can vary according to a variety of factors such as 
the individual's condition, weight, sex and age. Other factors include the mode of 
administration. The appropriate amount can be determined by a skilled physician. 

Compositions can be used alone at appropriate dosages. Alternatively, 
co-administration or sequential administration of other agents can be desirable. 

1 5 The compositions can be administered in a wide variety of therapeutic 

dosage forms in conventional vehicles for administration. For example, the 
compositions can be administered in such oral dosage forms as tablets, capsules (each 
including timed release and sustained release formulations), pills, powders, granules, 
elixirs, tinctures, solutions, suspensions, syrups and emulsions, or by injection. 

20 Likewise, they can also be administered in intravenous (both bolus and infusion), 
intraperitoneal, subcutaneous, topical with or without occlusion, or intramuscular 
form, all using forms well known to those of ordinary skill in the pharmaceutical arts. 

Advantageously, compositions can be administered in a single daily 
dose, or the total daily dosage can be administered in divided doses of two, three or 

25 four times daily. Furthermore, compositions can be administered in intranasal form 
via topical use of suitable intranasal vehicles, or via transdermal routes, using those 
forms of transdermal skin patches well known to those of ordinary skill in that art. To 
be administered in the form of a transdermal delivery system, the dosage 
administration will, of course, be continuous rather than intermittent throughout the 

30 dosage regimen. 

The dosage regimen utilizing the compositions is selected in 
accordance with a variety of factors including type, species, age, weight, sex and 
medical condition of the patient; the severity of the condition to be treated; the route 
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of administration; the renal, hepatic and cardiovascular function of the patient; and 
the particular composition thereof employed. A physician or veterinarian of ordinary 
skill can readily determine and prescribe the effective amount of the composition 
required to prevent, counter or arrest the progress of the condition. Optimal precision 

5 in achieving concentrations of composition within the range that yields efficacy 
without toxicity requires a regimen based on the kinetics of the composition's 
availability to target sites: This involves a consideration of the distribution,- ; : 
equilibrium, and elimination of a composition. - 

" The present invention also includes antibodies to the CYB5RP protein.. 

10 Such antibodies may be polyclonal antibodies or monoclonal antibodies. The : - 

' ^antibodies of the present invention are raised against the entire CYB5RP protein or „ 
against suitable antigenic fragments of the protein that are coupled to suitable carriers, 
e.g., serum albumin or keyhole limpet hemocyanin, by methods well known in the art. 
Methods of identifying suitable antigenic fragments of a protein are known in the art. 

15 See, e.g., Hopp & Woods, 1981, Proc. Natl. Acad. Sci. USA 78:3824-3828; and 

Jameson & Wolf, 1988, CABIOS (Computer Applications in the Biosciences) 4:181- 
186. 

For the production of polyclonal antibodies, CYB5RP protein or an 
antigenic fragment, coupled to a suitable carrier, is injected on a periodic basis into an 

20 appropriate non-human host animal such as, e.g., rabbits, sheep, goats, rats, mice. 
The animals are bled periodically and sera obtained are tested for the presence of 
antibodies to the injected antigen. The injections can be intramuscular, 
intraperitoneal, subcutaneous, and the like, and can be accompanied with adjuvant. 

For the production of monoclonal antibodies, CYB5RP protein or an 

25 antigenic fragment, coupled to a suitable carrier, is injected into an appropriate non- 
human host animal as above for the production of polyclonal antibodies. In the case 
of monoclonal antibodies, the animal is generally a mouse. The animal's spleen cells 
are then immortalized, often by fusion with a myeloma cell, as described in Kohler & 
Milstein, 1975, Nature 256:495-497. For a fuller description of the production of 

30 monoclonal antibodies, see Antibodies: A Laboratory Manual Harlow & Lane, eds., 
Cold Spring Harbor Laboratory Press, 1988. 

Gene therapy may be used to introduce CYB5RP polypeptides into the 
cells of target organs, e.gr, the pigmented epithelium of the retina or other-parts . oiUhe, 
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retina. Nucleotides encoding CYB5RP polypeptides can be ligated into viral vectors 
which mediate transfer of the nucleotides by infection of recipient cells. Suitable 
viral vectors include retrovirus, adenovirus, adeno-associated virus, herpes virus, 
vaccinia virus, and polio virus based vectors. Alternatively, nucleotides encoding 

5 CYB5RP polypeptides can be transferred into cells for gene therapy by non-viral 
techniques including receptor-mediated targeted transfer using ligand-nucleotide 
conjugates, lipofection, membrane fusion, or direct microinjection. These procedures 
and variations thereof are suitable for ex vivo as well as in vivo gene therapy. Gene 
therapy with CYB5RP polypeptides will be particularly useful for the treatment of 

1 0 diseases where it is beneficial to elevate CYB5RP activity. 

The present invention is not to be limited in scope by the specific 
embodiments described herein. Indeed, various modifications of the invention in 
addition to those described herein will become apparent to those skilled in the art 
from the foregoing description. Such modifications are intended to fall within the 

1 5 scope of the appended claims. 

Various publications are cited herein, the disclosures of which are 
incorporated by reference in their entireties. 
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WHAT IS CLAIMED: 

1 . A recombinant DNA molecule encoding a polypeptide having 
the amino acid sequence of SEQ.ID.NO.:3. 

5 

2. A recombinant DNA molecule comprising a nucleotide 
. - sequence selected from the group consisting of: - r-r 

- - SEQ.ID.NO.: 1; - ^ 
' : .r SEQ.ID.NO.:2; 

- .10 - • SEQ.DD.NO.:2 lacking positions 1,019-1,054; 

\ positions 71-1,405 of SEQ.ID.NO.:2; and . ...«■•- 

- ' '" positions 71-1,405 of SEQ.ID.NO.:2 lacking positions 1,019-1,054.. v. 

3. A DNA molecule that hybridizes under stringent conditions to 
1 5 the DNA molecule of claim 2. 

4. An expression vector comprising the DNA of 

claim 1. 

20 5. A recombinant host cell comprising the DNA of claim 1 . 

6. A CYB5RP protein, substantially free from other proteins, 
having an amino acid sequence selected from the group consisting of SEQ.ID.NO.:3 
and SEQ.ID.NO.:3 lacking positions 317-328. 

25 

7. The CYB5RP protein of claim 6 containing a single amino acid 

substitution. 

8. The CYB5RP protein of claim 7 where the substitution is a 
30 conservative substitution. 

9. The CYB5RP protein of claim 6 containing amino acid 

_ — ^substitutions where the substitutions do.not occur in-positipn^wher^the amincyicid_ 
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present in CYB5RP at those positions is also present in the corresponding position in 
the delta 6 desaturase from sunflower when CYB5RP and the delta 6 desaturase from 
sunflower are aligned by BLASTP analysis or where the substitutions do not occur in 
positions where the amino acid present in CYB5RP at those positions is also present 

5 in the corresponding position in the delta 6 desaturase from Synechocystis when 
CYB5RP and the delta 6 desaturase from Synechocystis are aligned by BLASTP 
analysis or where the substitutions do not occur in positions where the amino acid 
present in CYB5RP at those positions is also present in the corresponding position in 
the delta 6 desaturase from borage when CYB5RP and the delta 6 desaturase from 

1 0 borage are aligned by BLASTP analysis. 

10. An antibody that binds specifically to the CYB5RP protein of 

claim 6. 

15 1 1 . A DNA or RNA oligonucleotide probe comprising at least 1 8 

contiguous nucleotides of at least one of the sequences of claim 2. 

12. A method for determining whether a substance is an activator 
or an inhibitor of CYB5RP protein comprising: 
20 (a) recombinantly expressing the CYB5RP protein of claim 6 in a 

host cell; 

(b) measuring the biological activity of the recombinantly 
expressed CYB5RP protein in the presence and in the absence of a substance 
suspected of being an activator or an inhibitor of CYB5RP protein; 
25 where a change in the biological activity of the recombinantly 

expressed CYB5RP protein in the presence as compared to the absence of the 
substance indicates that the substance is an activator or an inhibitor of CYB5RP 
protein. 



30 



13. The method of claim 1 2 where the biological activity of 
CYB5RP protein is the ability to introduce a double bond into the 6 position of 
linoleic acid. 
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1 4. A pharmaceutical composition comprising an activator or an 
inhibitor ofCYBSRP. 

15. A method of treating macular degeneration comprising 

5 administering to a patient an effective amount of the pharmaceutical composition of 
claim 14. 
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1 gctcacagac cgggactccg cctccggttc ccgagggcgt ggcgaggcgc 

51 tgcgggacgc ccaacaggtg cgtgttgtgt ccccaggccc cgcgctccgg 

101 gtggagtcaa gagcctggaa gccggcagcc cgggaaaagg gggcgggacg 

151 gtgccccggg gcagggctgg gtggcggccg ctgtcctccc gggaggggcg 

201 ggccgcctcg acgccgccct ccctggcggc caatggagac cgaggccccg 

251 cgcctggatt ggagcggacg cgggggtcag ccagccttgg gggccggggc 

301 ctggccgggg gcgggggggc aggcgaggcg aggcgggcgc cgtccgcgcg 

351 at tataaaa c ggggagttcc ctgcgccgcg agccgggagg cgcacgctcg 

401 ctcgtacggc ggccgcggcg gcagggcggg gccggagcag cgggcggcgg 

451 cggaggcggc gcccgggagc get cttcgc t TC.CCTQQQQQ TCTTQCTCQG 

501 ACCTCGGCCA CCGCCTGGGA T CCCCAGGAC TCGTGCGTGC AGCATGGGCG 

551 GCGTCGGGGA GCCGGGACCG CGG GIGGGAC CCGCGCAGCC GGGGGCGCCG 

601 CTGCCCACCT TCTGCTGGGA GCAGATCCGC GCGCA CGACC AGCCCGGCGA 

fiSt CAAGTCGCTC GTCATTGAGC GC C GCGTCTA CGACATCAGC CGCTGGGCAC 

701 AGCGGCACCC AGGGGGCAGC CGCCTCATCG GCCACCACGG CGCTGAGGAC 

751- GCCACG ataa ggaagecata aggaagecac ccaccggcgg gtggagcc.tg. 

801 gagcteggtc gtgggcgtga tgtcccgctc cacctgtggg gecttagcat 

851 cctccctccc" ctcgctgacc tttgacctcc acgccgggac ccagagttgg 

901 ggtggactag ccagggccag atgtggggta gggagggcag ttccctgcgt 

951 ggaggacccg cagctgtcca eggagcaggt ctgcggggga ggagggggee 

1001 tcagaggtgg gtgtgtcatg ctgcagagcc tgccctgggt gaggggctgc 

1051 cctgttgctc ccaggtccct gtttcagttc tgggtcccca tgctgggtgc 

1101 ttgctgagtg ctaggggtag ggcagggcag ggtccccagg ggccggtaag 

1151 gaeatgecat tagaggctgg gggctgggcc ggcctgaggt ctgtggcttt 

1201 cccaagagct tctgtaaagg gctcagggac agtgactcac ctctccgggc 

1251 tagcagctgc acgtgggagg getttgecag ccaggctggg tgggcctctc 

1301 ctggaagcac agtcacccca ggaacaggct ggcccctggg gaccccaact 

1351 tcccaatccc agcccctgtc tagacaggca gggatgtagc ctggccccag 

1401 ggtactgtct ggctggagtc cagtggtgga gcagcccgac cagccccttt 

1451 tccttagtta cccacctgca taataggggt tggggccacg atgccctgtc 

1501 cttgaccctc caaatttcta ggttggccac actgggtatc aggaaggtct 

1551 tcaagacccg aggacatgaa tectgaatge tggctttttg ggcagcagcg 

1601 gaggttctgt ccagtcccag gaetgtegge gtccctcttg ccagggccac 

1651 ctgctctctg ccgattgcca tctccagcat gttggacaat cttcactgga 

1701 ctctttgagg aagaaagece ctcttttccc tttccacccc atgaagctga 

1751 ggagtgagaa taagaatcct cctgaaattc taaaaaaaga aaaaaaaaaa 

1801 aaagagaacg ccttgtccgt ggctgttcag gcgccagacg ctggcccgag 

1851 gggacagcac agccgtggga tgaagcagee tgggggcagt atttgagegt 

1901 gcaggtgttt gcatgtctgg gtgagtgtgg tgtgtgtgcc tgcctttctg 

1951 ecagggegtg gcgaggtgag gggcaegget tctccccaaa ggccttgctg 

2001 agccctggcc tcccttcaag gagtcttgtg gatgectget ctggtctttt 

2051 tttaaaaaag tatctatttt atttattatt atttgtttaa aaatagagac 

2101 agggtctcac tatgttgctc gggctggtct caaagtcctg ggttcaagca 

2151 ttcctcctgc ctcagcctcc gaaagttctg ggattacagg catgagccac 

2201 cactcccggc ctgctctagt cttttgtaac ctagaggaca gtatggatac 

2251 agaaaacttt actccccacc aaccgccgga gacagagtct tgctctgcca 

2301 cccagactgg agtgcaatgg cgccatcttg gctcactgca acctccgcct 

2351 cccaggttca agegattetc ctgcctcagc ctcccgagta gctgggatta 

-2401 - cgggcacgcg ccaccacgcc cagcatattg^tatttttagt agagaegggg 

2451 tttcaccatg ttggccaagc tggtctcgaa ctcctgacct cgtgatccac 

2501 ccacctcggc ctcccaaagt gctgggatta caggegtgag ccaccacgcc 

2551 eggctgggat acagaaagct tttatttcat cactgtttcc tgcctggtgc 
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tttaatcctc 
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ttcagtgatt 
gctatgctgc 
tgtttgactc 
gttctcacct 
gttcacagca 
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gtttgtgtcc 
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gatccttaat 
gggaggatgg 
gagtgtggcc 
gtccagtgag 
gagagggagc 
cttctgggtc 
gtcctgggca 
catgggcaac 
caactcaggg 
ctgaagccct 
gccctctgcc 
agctggagga 
ttgcagccaa 
gtaactgatt 
gcccattttg 
aaccttctca 



ctggggttcc 

ctgagacttc 

tgttttaaaa 

agcactttgg 

gaccagcctg 

aattagccag 

ctgaggcagg 

gagatcgcgc 

tcaaacaaac 

gctctgctat 

ctgtgcccca 

gagctggctg 

gccaggtagt 

cgtgccctgc 

ctttcctggg 

gacactttag 

gagttgccag 

ctgggatggt 

aggggttggg 

atcccatccc 

gggatgtgtt 

gttctcaagt 

tctgtgccat 

ccctggggag 

ggcagcctga 

tgctggccat 

aggttttgaa 

aaggtgcccc 

atgagaaaac 

ggtctgggat 

cacctatcct 

attaaatcca 

ctgaagatgt 

gagggatcag 

tccaggaata 

cagtgaggtc 

tcggaaggac 

acagcaggga 

ggagagagtg 

agacctgccc 

agggcagaga 

agccattaga 

cagcctggcc 

tcagtttccc 
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aaagctctat 

cggatgagga 

aacaactgtg 

ttaggaccct 

gggagtggcc 

cagttattga 

gaattcgcct 

tagatgagga 

ggacccttga 
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tcccaagtgg 

catcacacag 

acatctcagg 

gaggctgagg 

accaacatgg 

gcgtggtggc 

agaatcgctt 

cattgcactc 

aaacaaaaaa 

tggaggcact 

gtttactcat 

agggcaggac 

agtttgtatt 

aaactgcagc 

tgaccttatt 

gctgtctctt 

aaatacttgg 

gcaatcagtc 

ccagcccagt 

gggcggtggc 

ccagcggtca 

ccaggcaatg 

gaagtcagcc 

agtactctgg 

tactagtgga 

aagtatataa 

atcagtactt 

tatgaggcac 

agaggctcag 

ttgaatccac 

gtttatttcc 

tcagtgagca 

agctgtgagc 

ctagagcaaa 

cagtatggct 

tagaggcggc 

tttggcttta 

agcctgggga 

aaggcagagc 

tcccccaagg 

gcaagagagg 

cacattagat 

tcgctctatg 

cattagtgtg 

gtggtgctgg 

tcatgggtgt 

gtttcccagc 

ggaaaatcca 

ccttctaggt 

agtgctgcag 

ctaggcactg 

aacaacttta 

gactgagttt 

agggtagggc 



aattactgac 

ttttctcatt 

ccgagcgctg 

tgggcagatc 

agaaaccctg 

gcatgcctgt 

gaacccagga 

cagcctgggc 

catctctctg 

gagcgacctt 
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tgtgtctcct 
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tgtggccatc 
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gatttattga 
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agaggcaaag 

agccatgttc 

ggcactcatt 

tcttctctgt 

aaaacttcta 

ggctcagagg 

gcagcagaga 

cgggctggct 

ttttaaagag 

gtctgatgga 

cttccagaag 

gaggccactc 

ctgtggacac 

ttagcttcat 

atcttggaca 

atgaggatga 

gcacctgggt 

ggtgaatgca 

agcccctggt 

agttccagca 

ggttctctga 

cagaggctgc 

ttctgagggg 

tgaggtaagt 

gaaactgggg 

ctttgtactc 
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tctcttctaa 

aatcccagca 

gacggaggtt 
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ggagagctgg 

caagcccatg 

tgctggcccc 

acagccctgc 

cctatctgaa 

aggccccaca 

ccaaagggac 

atgaacccac 

gttgagccca 

ggtggggcag 

gccccggctg 

gccaggaaat 

ttgggacaca 

aggtttgtct 

gagggatgag 

ctgagtgcct 

aaccctctct 

ttgttattgc 

tggcttgaaa 

ttaagggcat 

gattcttcaa 

gtcatgcatg 

cagggaatga 

tgggaccgtg 

gcagtggaga 

catgctggat 

gatggggagc 

catttaaaag 

ggtaagagaa 

agaagaggta 

aggcacactg 

gttgtcttta 

catcctttca 

gaatgctttt 

gcctggttac 

ttgcccacag 

gccctttcgg 

gaccccctga 

gcctggcctg 

ttcatagtaa 

tttagatgtg 

cctattgtta 

ggtgtaatgg 

gggccacgag 
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5301 ggtggggttt gtgtctgggt gggagctggg gagggacagg actaggatta 

5351 ggcagatctg aggccacagg agttggttgg ggggtggctc cagagccact 

5401 ccactccctc ctaccacatt gactgccttg aaagtcccct aatggccact 

5451 cccatgaagt gtgactgctc tgggctcccc gcaggcgttt tctgcaaggc 

5501 caccgcccac ccaggcccct tccccagagg ggctgcagtg ccttgctcct 

5551 tccttgtggg aagagttggg attgtctggc gtcagcagga tactgcccct 

5601 gggcatccct cccggtctct tcctgcgggt ttctgatgaa acagccaggc 

5651 tccagtagtg gagccagagg tcagtggtgg agagaggacc aggagccaga 

5701 gggtatagct gctttggggc tactgtgggg tcagggacac ttgtgaggcc 

5751 aagcgtcctg gctgcaggag ccctcacata tatgcccacc cttcaccagg. 

5801 acat.tgagggl.gtgctggggg acaggggtag ctttttgggg_gtgtGtgcct< 

5851 tcgacttggg^ctccgctaca. caggccaaat . ttggatgtcc catgtttaga^;. 

5901 gctgtgtttc tttgggacct cttggggcct cagtttcctc. atctgtaa^a 

5951 tgggatactg^atagtgcttc cccactggcc tcctctgacg ggcgccaggg^ 
6001 " agaggatggg acggagcatg" gtgtgctggg cacgctcctg ctgt'acccac 

6051 ccacct'ggga^gaggggagag gcaggaatgt cctgggggtg tcctttgagg 

6101. rcatagccctg" tcaccccaac atcctacaaa ggcatgagaa ggcagcgagg , 

6151 acagaccccg accacctgag ccctcagcag ccctgccaca- ctccctgctt ~ 

6201 * cacccccttc ctgactgatc tggcacattc ttgattctcc tagggagtga. 

6251 cccaaaatcc ctccctgccc tgctgtgtct ctggggtgga aggaggctgc 

6301 cagcccctcc tctctcccag cctcaggctt ggccaggact taacaggcag 

6351 gcagagaagc agcttctcca ctctcttccc tgacacctgt aggcccctcc 

6401 tgcaggcact tacctctaag tggactctca ggaggaggct catcagggct 

6451 gcagggctca gaaagagctg ggctgtggag ctcttgccaa ccgccaggcc 

6501 ccttctaagt gctttagcgc caccgactgc atcctcccag cagccttgtg 

6551 agatggggat ttgtggttcc cagtttactg atgagaaata ctgatgagag 

6601 atgggtgtgg tcttgtctgg ggctccctgg ctcctggata gcagctcagg 

6651 ttccatcctg ggcaggctgg ctctgggaca cccccccgac cagctgctgt 

6701 gtgggattca cggtggggct tgggcagggc gtgggatctt ggggccaact 

6751 gagccactct aggcttccag ggaccaaggc caggctgagc tgtctctgta 

6801 tcctgagaga gcatgaacat cacagaagat gggcccgggt tcgaatccca 

6851 gctctgccac tactaactgg gacctgggca ggggtccctt cccgctgagc 

6901 cttcatttcc tcaccagcaa aatggttcgt gcccctgctt tgggggctgt 

6951 ggagggttgg ctcttgtcta cttgttcata cctgctgttg agcagctgct 

7001 ctgtgccggc ctctgaggat gccactgtga acagagcctg tcgctacctc 

7051 caggagcttg tgtttagggg tgccgttttg attccagcac tttcacccag 

7101 ctctgctccg gtacccgatg agagacgtcg agtgccgctt tccactcgct 

7151 tgggtgcgtg tgggggttgg ggggacaggg ctttgtgcac gtagccctgg 

7201 gtggatgttc ctgggtgcac ttagggtgtg tgagggtggg acctcccaca 

7251 gttccctgag gctccactga tgaggtccaa gaaccgcctt cctgcccccc 

7301 agcccaggct cccagcagct gggcccttgg cttcttgaga tagtgactgg 

7351 cctcacggca aggacccccg cacaccacct aggagaactg ctgcttcccc 

7401 tctgttccag gagtggcgac aagcacagtt tttcgctttt gtttttgttt 

7451 tcttcacttt aagttccggg aaacgtgcag aatgtgcagg tttgttacat 

7501 aggtatacat gtgccatggt ggtttgctgc acccgtcaac ccctcatcta 

7551 ggttttaagc tccatataca ttaggcattt gtcctaatgc tctccctccc 
7601 cttgcccctc acccgcccag taagccccgg tgtgtgatgt tcccttccct 
7651 gtgtccatgt gttctcattg ttcaactctc acttatgagt gagaagagac 

^7701 .ctggactctg a.tctaacctc .ggtcaaatgg aactgtgtga ccttgaagaa 
7751 gtagcttaac ctctctga'gt cttagcttct gcctggcacc cccatcctta 
7801 aggagaggcc cacagaggac caggtcacat gacctcagcc agttccagag 
7851 aaggctgttt gcttccaggt ttcggcctga gtccaggccc ctgccctact 
7901 cgcactccct gatagcatga gaagcacagc cccagggtgc ccacccagct 
7951 ctgagagccc agcctgcttc ccagggaact gtcacagccc cacctgtccc 
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acatagtgag 
ggtggcatgt 
tcacttacga 
gcactccagc 
gtgtttttct 
tgggaggctg 
tgggcaacat 
gctgggcatg 
gtgggaggat 
tgcaccactg 
aaaaaaaaaa 
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gggggtgaga 
aagtcagccc 
ccaagctgcc 
atgtaggtgt 
tcccccaggg 
tcacttccgt 
aataggaggc 
gcaactggga 
ctactggttc 
agcctctgaa 
agtgagccga 
gagcccaggt 
ctggtctctt 
cacccccttt 



ggagccctgt 
cacacttccc 
ggcaggcttg 
gggtgaatca 
aagatcctca 
gggaagggct 
gttgcgccgg 
gtgcaaggct 
tgtttctgag 
ctgcatgtgc 
ccatcacgct 
caacatggat 
caaaggaaca 
gatctgtatg 
gagaatggag 
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tccctggccc 
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caatggcttt 
cttatcattg 
gttctgtcct 
ggaccaacgg 
gctcaataag 
gggagggagg 
gtgtgtctgt 
gtgtttggct 
tgtctgagtg 
gtgtgtgtct 
ggctcagcct 
ggaccttaag 
aacatgtgat 
gacagaaggt 
aattagtgtt 
ctggagctgg 
accactgagc 
attttactac 
taatcccaac 
agttcaacac 
tacaaaaatt 
agtaggctaa 
taagccgaga 
ctgtctcaaa 
tcacacctgt 
gaagtcaggc 
tacaaaaaaa 
ctagctgctg 
aaggctgcag 
agtgaggccc 
tggtggctca 
attgcttgag 
atctctacaa 
cctgtactaa 
cgggaggttg 
tgggcgatac 
aaaaacaccc 
gctcccctgt 
gcttcctggg 
gctgcccctc 
tacagccccc 
agaaggggca 
aaaagcaggc 
tgggtgctgg 
atgtcctggc 
accagctggt 
agtcagcttt 
ctccagaccc 
tgtatggctc 
aaaaaatcag 
aagaccacag 
agcttacttt 
ttatctttct 
tcttccttct 



ggggttctct 
caaggggtag 
cccctgtcag 
gaaaggaggc 
gtggccccag 
agattctggg 
gcgtgccagc 
gagtgttcat 
atgcctgctg 
ggggagtttc 
taaaaaggta 
gacattgtgc 
ttctcccaga 
agcatggtgg 
taatggggac 
atgatggtga 
tggacaccta 
aatttttaaa 
actttgggag 
cagcctgggc 
agcctggtgt 
ggcacaagaa 
tcatgccact 
aaataaaaga 
actctcaaca 
atttaagaca 
ttcaaaagtt 
gggaggctga 
tgagctgtga 
tgtctcaaaa 
ttcctgtaat 
cccaggagtt 
aaaataaaaa 
cagctacgag 
aggctgcagt 
agcaagaccc 
agtggggtca 
acaccagccc 
gacccccagc 
cttcaagact 
tcctctgtaa 
ccctaccagc 
aaggggaccc 
aggggcctga 
agtgacttca 
aggctcatta 
cttcaaaggt 
atggggacaa 
tggctagccc 
tccaagagct 
gcagacctgg 
acttctgggc 
cctggtctgt 
cctgcatcgt 



gacacagccc 
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agagtttcag 
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aaattggctg 
gccaaggcgg 
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ggtggcttgc 
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cccccagtcc 
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cactccagct 
ttatgatgtt 
cccaggctgg 
tcttacagtg 
atttacttta 
ttccagcact 
taaagaccag 
ggaggctgag 
gaactgagat 
tctcaaaaaa 

. agtcaaatca 
ctcaatgtgt 
cctcagttcc 
aaaaacgggg 
gtggaaactt 
tggttcaaga 
ctgcagatgt 

■-.ggatggggca- 
gggaggacca 
ttcctgtccg 
ctgtctctgc 
ctgactgtct 
TCAATTTTGT 



attacacaga 
ttcttttttg 
tcttgaactc 
ctgggattac 
aaaaaaaaat 
ttgggaggcc 
cttggccacc 
accggagaat 
catgccattg 
aaaaaaaatt 
gtttaactgt 
gtcgcccttg 
aggttttccc 
taaatcaaat 
gtatgactta 
aatcaaagtc 
ttgaccgtgt 
cagcaggcaa 
gggagggccc 
cctgtcgtct 
tggttgccgc 
gctctccttc 
GCGCAAGTTC 
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atcgcgagaa 
taaaaataga 
ctggcctcaa 
agatgtgagc 
taggctgggc 
aaggtgggca 
tggggtcagg 
tgcttgaacc 
catgccagcc 
atgttttgtg 
tcaagtgtct 
actgatcecc 
acctaccctt 
gttcgtgacc 
agctttttgg 
ttcccgggag 
tcagagaggg 
tgggtgaaaa 
atgtctttga 
gctctgccca 
tgtgctactc 
aggfrTGCCTT 



tgttggatta 

gacaaggtct 

gcaatcctcg 

caccatgcct 

gcggtggctc 

gatcaactga 

agtttgagac 

caggaggtag 

tgggcaacag 

ctcctgcttc 

tccttgcaaa 

ccgccccgtg. 

cacccactgc 

cagatcttat 

aaaagcagaa 

gtctttctgt 

gcccttgtgc 

gcaggacaac 

ctgttcatca , 

tccatccgta 

agctgtgtct 

ccgtgccttc 



ttcattttat 

cactatgtgg 

tgccttggcc 

ggcccatttt 

acacctataa 

ggtcaggagt 

cagctactcc 

aggttgcaat 

agcaagactg. 

ctgctttgta 

cccccaagga. 

acccagtggt" 

ttatgtttat" 

tctacatgca 

ccttttttcg. 

aaatccagag 

tgggtgaagt^ 

-ctggggccct 

gccggctgac * 

gtccttccgc 

gtctgtccgc 

C ATT AAG ATC 



maass agagctgqct 



llfl^l CCGG AAGAAC CCAGCCAGGA TGGACCCCTG 



11901 
11951 
12001 
12051 
12101 
12151 
12201 
12251 
12301 
12351 
12401 

12451 



gagaggctca 
tgccacatgg 
agccagggct 
gaagctgtag 
caggcactcg 
tgtccccatc 
agcgtcttac 
cagccagtgc 
tcctctttgc 
ctccaggaga 
ctctgctatg 



gcccctgagg 
ccaggagcag 
gagcctgccc 
ggatgccctg 
tttggatccc 
aaaaggagga 
ccaccccata 
tccagctcac 
ccacacccct 
atgggggtgg 
gcag£Q2£&£ 



gagggggatg 
ctccctcggc 
tcccctccca 
agaagtccag 
gaggcaagct 
ttttgatgaa 
ccttttggga 
accccgggct 
tgggcctggc 
ggaggaattt 
CTGGTCGAGG 



AAT atqagcc 
gctggagggc 
attcgcccaa 
gggggcaggc 
ggctccagat 
ccctccctgt 
ctgatttctc 
gggagaggag 
gggtactctt 
gatgggagga 
cttccttggc 

APTTCCGAGC 



agagccctag 
tgggagacat 
ggggatgcag 
agttgaaagt 
ctggtttagc 
tgtcgcccag 
tcctggctgt 
gcttcaccac 
gtcacttcat 
gcggctgggg 
tgatcggccc 

CCTGCACCAG 



GCAGCCGAGG ACATCAAGGT GTTTGATG rr AGTCCCAPCT TCTTTGCTTT 
12 501 GCTACTGGGC CACA TTrTGG CC ATGGAGGT GCTGGrrTT 7 G CTCCTTATCT 
13551 ACCTCGTGGG TCPTCflCTCG G TCCCCAGTG PCCTGGCCGT CTTCATCCTG 



^7f i 0^ GGCATCTCTC 



12651 
12701 
12751 
12801 
12851 
12901 
12951 
13001 
13051 
'13101 



aacagacgtg 
tgaatgaatg 
agtgaacagt 
ccagatgagc 
gccactgagc 
cctccacctc 
ccacaagggt 
aaaggattca 
gcttctgacc 
GACCTGGGCC 



££gtgacccc 
ggcccccatg 
gacctatgaa 
ctgaaggccc 
caagaagttc 
cagaggctgc 
cctgggactc 
cttgaaatgg 
ctggggcccc 
ctgcctctct 
-ATSCCTCCAT 



agttctgtgt 
catctgggca 
aggatgaatg 
atcaggcatg 
cttcttgaac 
tgccctgcag 
agttctcatc 
agcattagca 
aggccctggc 
ccccag££2£ 



tgcagccacc 
ttgtgaacat 
gatgaataaa 
tctgtgggtc 
agattccgat 
cttcatgaca 
tgtaaaaaga 
cgggggtacc 
gggctccgtc 
ftGTCCTGGTG 



ttaactgccc 
atttgctaaa 
cagatgaatg 
aagctgcatt 
caagcacagg 
cttacgagcc 
ggacactggc 
ctgcaagctg 
cttcccaaca 

TGTGCAGCAT 



.- TCrTOGTGGAr ACCAGGTGGC 



13151 CCAGAAGTTC GTGATGGGGC 



13201 
13251 



gccaggtgct 
gggcacagcc 



gggtggcgct 
tgccctgaga 



AGCTAAAGgt 
gggtctgccc 
gccccctcct 



gagggtgggg 
aagtgtgtgg 
cctccacagS 



tgggtggtca 
gcacagtcgg 
fiCTTCTCCGC 
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CCACTGGTGG AAC TTPP GGC ACTTCCAGCA CCACGCCAAO CCCAACATCT 

1?3 51 TCCACAAAGA CCC AGACGTG ACGGTGGCGC CCGTCTTCCT CCTQGGGGftG 

1^401 tcatccgtcg AQgtgggtgg ggagggacct ggacaacctc tggctgggcc 

13451 tgcagctgag ggggagctaa tgcactgggt ccccactctg cccctgacct 

13501 agcccctgat cfcggcctcca ctctggctgg gccaagctct gcccccgtgt 

13551 ctttccttcc cacctcccaa cctgctgggg acgaccagcc cgcttgctag 

13601 aatctagagt tgcctttgac ccttggcccc agccagcccc gtgaccttgc 

13651 ccgggagaag gaggtggcct ggagagctgc tgtctccagc cgccgcctgt 

13701 ctccacagm TGGCAAGAAG AAACGCAGAT ACCTACCCTA CAACCAGCAG 

13751 CACCTGTACT TCTTCCTGA a tgagtgtcca tctgtccttc tggggtgggg. 

13801 gagtgcctgg gcctgcactg tcctccctgc tgtcctggac cactcccagc 

13851 cacttcctgg ggcggggcac gtctgtcagg tctccctggt catggcatcc 

13901 tcccagcctc tgcagtctgt acacactctc ccagcagcat gcctttgccc 

13951 cagctgtctc ccgtgcctgg gacaccttgc agccacgggc catcacagcc 

14001 ctgctgggag cttccccaag ccccacgtag aatttcttct tgccctcact 

14051 agagtggtcc ggagccctag agtctttggg cagttgttgg ggcggacaga 

14101 gtgaggactc aagtctggcc ctgacttgcg gtgaagggtg "gtgggaggtg 

14151 gtggggtaag ggcagcctgg ggaggcttgg acacagaatt gggggtgata 

14201 tggggtcatt cagctggatg tgaccagcac caacgtccca ggggcattcc 

14251 tggagtaaca gagcccctca ctctggcgcc cactcacctt ggcagcccag 

14301 ccccactcct gaacactctc atgccccttc ttgcaglEGG CCGGCCGCTG 

14351 CTTACCCTGG TGAACTTTGA AG TGG AAA A T PTGGCGTACA TGCTGGTGTG 

14401 CATGCAGTGG fiCfigtgagtg gggttgccca ggaccccggg catacggctg 

14451 ccgtggcagg aggtggtgcc tcgggggaca gtacctgccc atgaaggcaa 

14501 acagggtgca catgtgcgtg caacagtgtg gctcacatgt atgcgtgcaa 

14551 cagtgtggct cacatgtgtg cgcgcagcag gagagcgagt gtgcccgtga 

14601 ctgtacgtgt ggtggggggg ggttgaggaa cagggggggt gtgggtctct 

14651 ctcggtgagg gtgtcttccc aggaggagtt gctgggccga ctctgccagg 

14701 catctgtgtc cctggcaggg tcttccccaa cacaccctgc atgacacctt 

14751 cgtcactaaa atcagcctcg tgagctggca gggcaaggac cctgttcctt 

14801 tactcagctg agaaaaccag agagggtggt ggcctgtcct gggctctgag 

14851 gcaaatcagg cagaagggtt ggatgcctga ggtcctcctc ccacccacca 

14901 ggcctccaga cctccgggca cctggagacc tctcggtatc gcctctgccc 

14951 tcctctgcag G ATTTGCTC T GGGCCGCCAG rTTCTATGCC CGCTTCTTCT 

IRQ 01 TATCCTACCT CCCC TTCTAC GGCGTCCCTG GGGTGCTGCT CTTCTTTGTT 

15051 GCTGTCAGc rt atggcaggga gtggcgaggt cacacacagg cgacaggtga 

15101 cccccactgc agccccccac cagagcttcc cttttcccgt ctgcagaatg 

15151 gggccagtgg tactgcctcc ctggcttgct ggtggaatca cataaacaca 

15201 agcgtggcag gagcccaggg tcggtgggtt tagggagcgt ggcctggctt 

15251 gtaagtggcc cggtgggtgt cggagctgct ctggactcag cctcacagtg 

15301 gacactgctc cattcagatt ctttaaacac tggcaagggg gcgatggcca 

15351 caatcctatt gtacagataa ggaagtcaag gccacttggg gacagctgct 

15401 ctccagcctc cactcagggt gcctaagtgg tgagctggac ctagggcagt 

15451 gcccgagcct ccccacagSS ttptcgaaag rr ACTGGTTC ftTGTGGATCA 

15501 CACAGATCAA C C AGATrCCC AAGGAGATCG GCCfiCGAGM GCACCGGGAC 

15551 TGGGTCAGCT CTCAG at aaa cagcaggggt ggggcccatc ctgggtgggg 

15601 tggggggtcc cagctaggag ccagatggca aagcagggat gaggccctga 

15651 cggggctgcc aggtggggga tggtgccgtg gggtcaggga tctgcaacgg 

15701 cctcctcaca tgtgccccgc cggcttccgg cagCTGGCAG C<"ft<"CTqCAft 

15751 CGTGGAGCCC TrAPTTTTGA C C A ACTGGTT rAGCGGGCAC rTCAACTTCG 

15801 AGATCGAGCA CCAataacrtcr tgggtgctgg gggccagtgg gaggtgggga 

15851 gggggtcctg ggaggggatc ctgggagggg acccgtgggt ggggcctctc 
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15901 tctggaatct cccacttcag gtgccagcat acgctcccca cccccagCCI 

15951 CTTCCCCAGG ATGCCGAGAC ACAAC T AGAG CCGGGTGGCC CCGCTgGTCA 
1 6001 AGTTGGTGTG TG C C^GCAC GGCCTCAGCT ACGAAGTGAA GCCCTTCCTC 



ACCGCGCTGG TGGACATCGT 



16101 
16151 
16201 
16251 
16301 
-16351- 
16401- 
16451 
16501 
16551" 
16601 
16651" 
16701. 
16751 
16801 
16851 
16901 
16951 
17001 
17051 
17101 
17151 
17201 
17251 
17301 
17351 
17401 
17451 
17501 
17551 
17601 
17651 
17701 
17751 
17801 
17851 
17901 
17951 
18001 
18051 
18101 



ctggtggctt 
gctgagcccc 
ctcccttccc 
tgcagactca 
acccccaaag 
-aaacacttaa 
-cggtggctca 
-atcacttgag 
ccgtctctac 
■ tgggtgtggt 
aggagaatca 
; gtgcctttgc 
"aacaaaaaaa^ 
: cgccctctag 
tgcaggtggt 
tacaagagga 
cggtcatcag 
aggctctaga 
aggaaacagc 
gcaaaagaac 
gggcggccag 
tattgacagg 
gccacaggga 
gaatgccaag 
gagaagggac 
tgactggcgg 
gccatccagg 
gtggggaaca 
acgctgttcc 
tgcccacttc 
ccatttggca 
tgttgacatt 
gcaggttcca 
agcaatggag 
tggacttggc 
ccggtctaga 
caccctccag 
gaatggggat 
atcttgcaac 
cagGTQCCTG 
AGTQAAGGCA 



ccccagggcc 
cagggtccct 
cagccttcac 
caaacagcag 
gctgagggaa 
cagggactca 
cgcctgtaat 
gtcaggagtt 
taaaaaaaaa 
ggcaggtgcc:. 
cttgaactcg 
agtccagcct 
crcafcgtctca - 
aaagaaccct 
tcttatgaac 
ggttatctat 
acagttttgc 
gagcttcctg 
aagcaattaa 
aatgaaaagc 
gcaggggcgg 
ctgacgtgtg 
catctggcaa 
ctcatggcat 
acgcccggct 
gcaggagtcg 
cgaggggtga 
gaggccagat 
ccttacttgg 
tccttcatga 
cagcccttta 
tctggctcca 
cgtcctctct 
aactgggtgc 
cttggctgcc 
ttccattcca 
cccgcatcgc 
ctccgctcac 
tgagctccta 
AAGAAGTCTG 



CAG ataaqgc 
tatgcctacc 
gagcctttct 
gcacacagtg 
tgcctgcggt 
aaaagctaac 
taaacagaag 
tccagcactt 
cgagaccagc 
aaaaaaaaac 
cataatccca 



caggggcaga 
gggcaacaga 
ggcagccaag ■' 
ctatatagca 
ctggtgacca 
gccatgagct 
ttgaagatgg 
ccctcttgga 
aataacaaat 
ccctcagggt 
cagtttctaa 
agcagggaca 
agagcgttca 
agatagccga 
tgcatcttgg 
aagtggaaaa 
tggggctcag 
tccaggtctt 
ttgcccttcc 
tctctcccag 
gagcgcctgg 
ctgcccaata 
gtccttgggt 
aggagggaca 
ctccagctcc 
gagaatgagc 
tgcctgcccc 
acttaccatg 
acacccaccc 
GTGACATCTG 



tgcagcccgg 
cttgtccagg 
gtccacgtcc 
agaatttctg 
gagcaggtct 
agatccagtt 
ccatgtctca. 
ggggaggctg 
ctggccaaca 
aaaacaaaac 
gctacttggg 
ggttgcagtg 
gcaagactbt 
_ag.ttgggaca., 
agcttttagg 
ctggaggtta 
tggcattcag 
cattgccctt 
gctgatgttc 
aagtacatta 
ggggacaggg 
ataggtggta 
gggaggaggg 
ggcagagggc 
ggcaggcatg 
aaagctgccc 
ggagagcaga 
cccttgtggt 
atacctctgc 
ttcctgtgct 
cctgatgctc 
cacagggctt 
tcaggcccaa 
tgcagcgccc 
ggcccaccca 
cctacccgac 
attcagctgt 
cagggaaggg 
ggggatacag 
ccactgccac 

GGTGGACGCC 



cccctctgtt 
tcagcctcat 
catgcccttc 
gagcacctac 
atgcaaacctr 
tctcagaagg 
-gggccgggtg_ 
aggtgggcgg. 
tggtgaaacc.-.- 
aaaaattagc;; : 
aggctgaggg^- 
agctgagatt 
ctcaaaaaca ~ 
±,cccctcaca 
gtgaacccca 
gataagcgtc 
ggtcaagcat 
gtagcaatgc 
cttccagcaa 
cagaagatgg 
gaggggaggg 
gggtgggcag 
gagaggtctc 
acttgaccct 
caggcactca 
ctactgggaa 
ggacactgca 
caccttggag 
gcctttgtac 
ggtgttcaga 
tgagcccctg 
cctagcagat 
gatcgggtgg 
agcaggaggc 
ggctcatgcc 
acccgtcacc 
tctcccaacc 
aacccacagg 
gggtgttagg 
ccccacctcc 
TACCTCCATC 



ACACrCAGGC GGGP AG AG AA GGGGTTAGGG CACCAGCAAC 



CAAGCCAGCC CCCGGCGGGA T C GATACCCC CACCCCTCCA CTCiPCCAGCC 



182 01 TGGGGGTCGC CTGGCTGCCC T GGTGGTACT GTTGTCTTCC CCTCggCCCC 



19301 



.PTTACATGTG TATTGAGCAG PPPTATGGCC TTGGCTCTGG GCCTQ ATGGQ _ 
AGAGGGGTAG AGGGAAGGTG AGP ATAGCAC ftTTTTCCTAg AQCGAgAATT 

ATATTAAAA T APATTCAGAT GTATTATQGA 



IftlSl GGGG GAAAGG TCTTATTTTT 



1B4Q1 GT 
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• • • • 

1 CTTCGCTTCCCTCGGGGTCTTGCTCGGACCTCGGCCACCGCCTGGGATCC 5 0 



• - • 

51 CCAGGACTCGTGCGTGCAGCATGGGCGGCGTCGGGGAGCCGGGACCGCGG 100 
1 MGGVGEPGPR 10 



• ■ • • ■ • 

101 GAGGGACCCGCGCAGCCGGGGGCACCGCTGCCCACCTTCTGCTGGGAGCA 150 
llEGPAQPGA'PLPTFCWEQ 27 



• • • • • 

151 GATCCGCGCGC ACGACC AGCCCGGCGACAAGTGGCTGGTCATCGAGCGCC 200 
28 IRAHDQPGDKWLVIERR 44 



201 GCGTCTACGAC ATC AGCCGCTGGGC ACAGCGGCACCC AGGGGGC AGCCGC 250 
45 VYD I SRWAQRHPGG SR 60 



251 CTCATCGGCC ACC ACGGCGCTGAGGACGCCACGGATGCCTTCCGTGCCTT 300 
61LIGHHGAEDATD AFRAF 77 



• • ♦ • • 

301 CCATCAAGATCTCAATTTTGTGCGCAAGTTCCTACAGCCCCTGTTGATTG 350 
78 HQDLNFVRKFLQPLLIG 94 



• • • • • 

351 GAGAGCTGGCTCCGGAAGAACCC AGCCAGGATGGACCCCTGAATGCGCAG 400 
95 ELAPEEPSQDGPLNAQ 110 



• • • • • 

401 CTGGTCGAGGACTTCCGAGCCCTGCACCAGGCAGCCGAGGACATGAAGCT 450 
111 LVEDFRALHQAAEDMKL 127 



* * • • • 

451 GTTTGATGCCAGTCCCACCTTCTTTGCTTTCCTACTGGGCCACATCCTGG 500 
128 FDAS PTFFAFLLGHILA 144 



- 

501 CCATGGAGGTGCTGGCCTGGCTCCTTATCTACCTCCTGGGTCCTGGCTGG 550 
145 MEVLAWLL I YLLG PGW 160 



• ♦ 

551 GTGCCCAGTGCCCTGGCCGCCTTCATCCTGGCCATCTCTCAGGCTCAGTC 600 
161 VPSALAAFILAISQAQS 177 



• ♦ 

601 CTGGTGTCTGCAGCATGACCTGGGCCATGCCTCCATCTTCAAGAAGTCCT 650 
178 WCLQHDLGHASIFKKSW 194 



• • • • • 

651 GGTGGAACC ACGTGGCCC AGAAGTTCGTG ATGGGGC AGCTAAAGGGCTTC 700 
195 WNHVAQKFVMGQLKGF 210 

FIG.3A 
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701 TCCGCCCAC^TGGAACTTCCGCCACTTCCAGCACCACGCCAAGCCCAA 750 
211 SAHW WNFRHFQHHAKPN 227 

751 CAT 



^TCTTCCACAAAGACCCAGACGTGACGGTGGCGCCCGTCTTCCTCCTGG 8 0 0 



228 I F H K D P 



DVTVAPVFLLG 244 



801 GGGAGTCATCCGTCGAGTATGGCAAGAAGAAACGCAGATACCTACCCTAC 850 
245 ESSVEYGK K- KRRYLPY. 260 

• • - •; :' "? '. ' / . ; ."' .' / 

900. 
- 277' 



851... AACCAGCAG-. „ _ . 

261 N Q Q H L Y F F. L : . I- G P P L L T L 



901 GGTGAACTTTGAAGTGGAAAATCTGGCGTACATGCTCGTGTGGATC 950:::. 

278 --V-cN -F ' ..E. -y^¥ .N. L. A Y- M; L V .C M Q W 294.., 

951 GGGCGGATTTCCTCTCGGCCGCCAGCTTCTATGCCCGCTrcTTCTTATCC 1000 

295 ADLLWAAS FYARFFLS 310. 

. . • 

1001 TACCTCCCCTTCTACGGCGTCCCTGGGGTG<:TGCTCTTCTTTGTTGCTGT 1050 

311 YLPFYGVPGVLLFFVAV 327 

1051 C AGGGTCCTCGAAAGCCACTCGTTCGTGTGGATCACACAGATGAACCACA 1100 

328 RVLESHWFVW ITQMNHI 344 



1101 TCCCCAAGGAGATCGGCCACGAGAAGCACCGGGACTGGGTCAGCTCTCAG 
345 PKEIGHEKHRDWVSSQ 



1150 
360 



1151 CTCGCAGCCACCTGCAACGTCGAGCCCTCACTTTTCACCAACTGGTTCAG 1200 

361LAATC NVE P S L F TNWF S 377 

1201 CGGGCACCTCAACTTCCAGATCGAGCACCACCTCTTCCCCAGGATGCCGA 1250 

378 GHLNFQIEHHLFPRMPR 394 

1251 GACACAACTACAGCCGGGTGGCCCCGCTGGTCAAGTCGCTGTGTGCCAAG 1300 

395 HNYSRVAPLVKSLC AK 410 



1301 ' CACGGCCTCAGCTACGAAGTCAAGCC^ 1350 



411 H G L 



S Y E V KPFLTALVDI 427 



1351 CGTCAGGTCCCTGAAGAAGTCTGGTGACATCTGGCTGGACGCCTACCTCC 1400 
428 VRSLKKSGDIWLDAYLH 444 

FIG.3B 
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1401 ATCAGTGAAGGCAACACCCAGGCGGGCAGAGAAGGGCTCAGGGCACCAGC 1450 
445 Q 

1451 AACCAAGCCAGCCCCCGGCGGGATCGATACCCCCACCCCTCCACTGGCCA 1500 

1501 GCCTGGGGGTCCACTGCCTGCCCTCCTGGTACTGTTGTCTTCCCCTCGGC 1.550 

1551 CCCCTCACATCTGTATTCAGCAGCCCTATGGCCTTGGCTCTGGGCCTGAT 1600 

1601 GGGACAGGGGTAGAGGGAAGGTGAGCATAGCACATTTTCCTAGAGCGAGA 1650 

• * * 

1651 ATTGGGGGAAAGC TGTTATTOTTATATTAAAATAC ATTC AG ATG TAAAAA 1700 



FIG.3C 
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CGGGGA 

M - G G V G E P G G G . L G P 



1 GTACAGCGGCAATGGGCGGTCTCGGGGAGCCCGGAGGGGGACTC •"'-•50 



51 - CGGGAGGGGCCCGCACCGG TCGGGGCGGCCCTACCCATGTTCCGCTGGGA V " 100 
14 -R T 'E " G ■ .P A -P L G A P L P I'F-; R W , ^ -,30 



101 GCAGATCCGCCAGCATGACCTACCAGGCGACAAGTGGCTGGTCATCGAGC 150 



31 QIRQHDL 



PGDKWLVIER 47 



151 GCCGTGTCTACGACATCAGCCGCTGGGCACAGCGGCACCCAGGGGGTAGC 200 
48 RVYD I S RWAQRH PGG S 63 



201 CGCATCATCGGCCACCACGG 220 
64 R I I G H H 69 



FIG.4 
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PROFILESCAN of : CYB5rp_correct_protein check: 5714 from: 1 to: 445 

GETSEQ from bmd, December 2. 1997 14:20. 

Compare to prof i le I ibrary: GenRunDato:profi lescan.fi I 



Profile: prof i ledir :cytochrome_b5.prf ■ 
Gop weight: 4.50 Gap Length weight: 0.05 • " .- 

Ave match: 0.27 Ave mismatch : -0.21 
(Peptide) PROF I LEMAKE v4.40 of: 0191.Msf2{*} Length: 48 

Sequences: 24 MaxScore: 27.58 December 2. 1992 00:07 
This profile is derived from PR0SITE releose 10.0 and has been tested 
by a database search against SWISS-PROT releose 26.0. A comparison 
of the SWISS-PROT annotation and the results of the database search follows 
For further information about this motif, consult the . . . 

Profile: prof i ledir:cytochrome_b5.prf olignment: 1 

Quality: 20.77 Gaps: 0 
Ratio: 0.43 Length: 48 
Normalized quality: 2.91 

S 31 HDQPGDKWLV1 ERRVYD I SRWAQRHPGGSRL IGHHGAEDATDAFRAFH 78 

|: Nil- .|||:::| • UN- I ll-MM ::| o 
P 1 HNDGEETWLWNGQVYD I TKFLEEHPGGPDV IME AAGTDATEEFEA1 H 48 

****************************************************** 
♦Cytochrome b5 family, heme-binding domain signature * 
****************************************************** 
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{g) pir:s68358 hypothetical protein - common sunflower 
Length = 458 

Score = 169 (79.4 bits), Expect = 2.8e-42, Sum P(4) = 2.8e-42 
Identities = 31/85 (36%), Positives = 49/85 (57%) Hjs box 3 



Query: 
Sbjct: 
Query: 
Sbjct: 



348 IGHEKHRDWSSQLMTCNVEPSLFTNWFSGHL^ 407 

4G K +W Q T++S + +WFGL FQ+EHH JPR+PR + ++P+ + L 
348 VGPPKGDNWFEKQTRGT I DIACSSVyMDWFFGGLQF|Q!JHH|.FPRLPRCHLRS I SP ICREL 407 



408 CAKHGLSYEVKPF LTALVD I VRSLK 432 

C K+ L Y F A V +++L+ 
408 CKKYNLPYVSLSFYDANVTTLKTLR 432 



Score = 133 (62.5 bits), Expect = 2.8e-42, Sum P(4) = 2.8e-42 

Identities = 21/53 (39%), Positives = 35/53 (66%) 

HPGG motif 



Query: 26 EQ I RAHDQPGDKWL V I ERRVYD I SRWAQffPGQSRL I GHHGAEDATDAFRAFH 78 

++++ H+ P 0 m- I +VY+-H- WA+ HPGGl + + +D TDAF AFH 
Sbjct: 22 KELKKHNNPNDLWISILGKVYNVTEWAKEjHPGgDAPL INLAGQDVTDAF IAFH 74 



Score = 118 (55.5 bits), Expect = 2.8e-42, Sum P(4) = 2.8e-42 

Identities = 25/76 (32%), Positives^ 34/76 

His box 1 



His box 2 



Query: 
Sbjct: 
Query: 
Sbjct: 



165 LAAF I LA I SQAQSWCLQpDLGHKS 1 FKKSWWNHVAQKF VK^QLKGFSAHWWNFIrFFQ^ 224 

L+ IL ++ Q L HD GH + WN A F+ +- G S WW 4 H HH 
152 LSGA I LGLAWMQ I AYLG(HDAGhffQMMATRGWNKFAG I F IGNC I TG I S I AWWKV0>jNAH}l I 211 



225 KPNIFHKDPDVTVAPV 240 

N 0PD+ P+ 
212 ACNSLDYDPDLQHLPM 227 



Score = 34 (16.0 bits), Expect = 2.8e-42, Sum P(4) = 2.8e-42 
Identities = 7/14 (50%), Positives = 9/14 (64%) 



FIG.7A 
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Q3 q p:bou79010 1 PID: q 2Q62403 Borogo off icinol is del to 6 desoturose mRNA, 
complete cds. (gb:U79010) (NIP: 2062402) 
Length = 448 

Score = 179 (84.1 bits), Expect = 2.3e-42, Sum P(3) = 2.3e-42 

Identities = 34/87 (39%), Positives = 48/87 (55%) - 7 

His box 3 

Query ■ 348"I(^ 407 
4G K +W Q T +f + 4-WF G L FQIEHHLFP+MPR N +++P V L 

"Sbjct: 338 -VGKPKGNNIIFEKQTOTLD1S 

Query: 408 CAKHGLSYEVKPFLTALVD I VRSLKKS 434 

C KH L Y FA +R+L+ + 
Sbjct: 398 CKKHNLPYNYASFSKANEMTLRTLRNT 424 



Score = 144 (67.7 bits), Expect = 2.3e-42, Sum P(3) =2.3e-42 

Identities = 23/53 (43% .Positives = 36/53 (67%) 

HPGG MOTIF 



Query: 26 EQ I RAHDQPGDKWLV I ERRVYD I SRWAQRpPGGfeRL I GHHGAEDATDAFRAFH 78 

++++ HD+PGD W+ 1+ + YD+S W + HPGGS + ++ TDAF AFH 
Sbjct: 12 DELKNHDKPGDLW I S IQGKAYDVSDMD|HPG^SFPLKSLAGQEVTDAF VAFH 64 



Score = 105 (49.3 bits), Expect = 2.3e-42, Sum P(3) = 2.3e-42 
Identities = 22/68 (32%), Positives = 28/68 (41%) 

His box 1 His box 2 



Query: 176 QSWCLCpDLGHKS I FKKSWWNHVAQKF VMJQLKGFSAHMNFRpFQHf^KPN I FHKDPDV 235 

OS + "HD GH + S N F LGSWW + HHH N DPD+ 
Sbjct: 153 QSGW I GHDAGHY MVVSDSRLNKFMG 1 FAANCLSG I S I GVMKWNjHNAHHjl ACNSLEYDPDL 212 



Query: 236 TVAPVFLL 243 
P ++ 

Sbject: 213 QVIPFLVV 220 



FIG.7B 
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B3p_ins35157 Delto(6)-desoturase - Synechocystis sp. 
Length = 359 

Score =126 (59.2 bits), Expect = 9.0e-09, Sum P(2) = 9.0e-09 
Identities - 21/54 (38%), Positives = 33/54 (61%) 

His box 3 

Query 372 FTNWFSGHLNFplE^LFPRMPRHNYSRVAPLVKSLCAKHGLSYEVKPFLTALV 425 

F NMF G LN b+ HCFP + +Y ++ ++K +C + G+ Y+V P A + 
Sbjct: 292 FWNWFCGGLNHQVThflLFPN I CH I HYPQLEN I 1 KDVCQEFG VEYKVYPTFKAAI 345 

Score = 36 (16.9 bits), Expect = 9.0e-09, Sum P(2) = 9.0e-09 

Identities = 6/15 (40%), Positives = 8/15 (53%) 

His box 2 

Query: 209 GFSAHWWNFRffQHHl 223 

G S+ W +R4 H 
Sbjct: 113 GLSSFLWRYR HNYLHl 127 



FIG.8 
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1 2 3 4 567 8 9 




1. Heart 6. Skeletal Muscle 

2. Brain 7. Kidney 

3. Placenta 8. Pancreas 

4. Lung 9. Retina 

5. Liver 
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1 2 3 4 5 6 7 8 9 L PCR Marker 

6. Skeletal Muscle 

7. Kidney 

8. Pancreas 

9. Retina 



1. Heart 

2. Brain 

3. Placenta 

4. Lung 

5. Liver 
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